Saint Paul
In Photos: One Week Since the Shooting of Renee Nicole Good in Minneapolis
Protests across Minnesota--and around the country--are ongoing, as residents demonstrate against their federal government. It's been one week since a US Immigration and Customs Enforcement (ICE) agent shot and killed Renee Nicole Good, a resident of Minneapolis, Minnesota . Since then, the city has been in tumult. Thousands of protestors--from young students to elderly residents--have taken to the streets, setting up memorials for Good and facing off with ICE agents. More than 2,000 ICE agents have been deployed to Minneapolis, with another 1,000 on the way.
- North America > United States > Minnesota > Hennepin County > Minneapolis (1.00)
- South America > Venezuela (0.05)
- South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.05)
- (5 more...)
Honesty over Accuracy: Trustworthy Language Models through Reinforced Hesitation
Mohamadi, Mohamad Amin, Wang, Tianhao, Li, Zhiyuan
Modern language models fail a fundamental requirement of trustworthy intelligence: knowing when not to answer. Despite achieving impressive accuracy on benchmarks, these models produce confident hallucinations, even when wrong answers carry catastrophic consequences. Our evaluations on GSM8K, MedQA and GPQA show frontier models almost never abstain despite explicit warnings of severe penalties, suggesting that prompts cannot override training that rewards any answer over no answer. As a remedy, we propose Reinforced Hesitation (RH): a modification to Reinforcement Learning from Verifiable Rewards (RLVR) to use ternary rewards (+1 correct, 0 abstention, -$λ$ error) instead of binary. Controlled experiments on logic puzzles reveal that varying $λ$ produces distinct models along a Pareto frontier, where each training penalty yields the optimal model for its corresponding risk regime: low penalties produce aggressive answerers, high penalties conservative abstainers. We then introduce two inference strategies that exploit trained abstention as a coordination signal: cascading routes queries through models with decreasing risk tolerance, while self-cascading re-queries the same model on abstention. Both outperform majority voting with lower computational cost. These results establish abstention as a first-class training objective that transforms ``I don't know'' from failure into a coordination signal, enabling models to earn trust through calibrated honesty about their limits.
- Europe > Austria > Vienna (0.14)
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
- Asia > Thailand > Bangkok > Bangkok (0.04)
- (14 more...)
- Research Report > Experimental Study (0.68)
- Research Report > Strength High (0.54)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- (2 more...)
Fairness Evaluation of Large Language Models in Academic Library Reference Services
Wang, Haining, Clark, Jason, Yan, Yueru, Bradley, Star, Chen, Ruiyang, Zhang, Yiqiong, Fu, Hengyi, Tian, Zuoyu
As libraries explore large language models (LLMs) for use in virtual reference services, a key question arises: Can LLMs serve all users equitably, regardless of demographics or social status? While they offer great potential for scalable support, LLMs may also reproduce societal biases embedded in their training data, risking the integrity of libraries' commitment to equitable service. To address this concern, we evaluate whether LLMs differentiate responses across user identities by prompting six state-of-the-art LLMs to assist patrons differing in sex, race/ethnicity, and institutional role. We find no evidence of differentiation by race or ethnicity, and only minor evidence of stereotypical bias against women in one model. LLMs demonstrate nuanced accommodation of institutional roles through the use of linguistic choices related to formality, politeness, and domain-specific vocabularies, reflecting professional norms rather than discriminatory treatment. These findings suggest that current LLMs show a promising degree of readiness to support equitable and contextually appropriate communication in academic library reference services.
- Europe > Austria > Vienna (0.14)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- North America > United States > Montana > Gallatin County > Bozeman (0.04)
- (17 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study > Negative Result (0.34)
- Health & Medicine (1.00)
- Education (1.00)
- Information Technology (0.68)
Autonomous generation of different courses of action in mechanized combat operations
Schubert, Johan, Hansen, Patrik, Hörling, Pontus, Johansson, Ronnie
In this paper, we propose a methodology designed to support decision-making during the execution phase of military ground combat operations, with a focus on one's actions. This methodology generates and evaluates recommendations for various courses of action for a mechanized battalion, commencing with an initial set assessed by their anticipated outcomes. It systematically produces thousands of individual action alternatives, followed by evaluations aimed at identifying alternative courses of action with superior outcomes. These alternatives are appraised in light of the opponent's status and actions, considering unit composition, force ratios, types of offense and defense, and anticipated advance rates. Field manuals evaluate battle outcomes and advancement rates. The processes of generation and evaluation work concurrently, yielding a variety of alternative courses of action. This approach facilitates the management of new course generation based on previously evaluated actions. As the combat unfolds and conditions evolve, revised courses of action are formulated for the decision-maker within a sequential decision-making framework.
- North America > United States > District of Columbia > Washington (0.04)
- North America > United States > Kansas > Leavenworth County > Leavenworth (0.04)
- North America > United States > Virginia > Fairfax County > Fairfax (0.04)
- (9 more...)
What to Do in St. Paul and Minneapolis If You're Here for Business (2025)
A convent turned hotel, Caribou Coffee, and progressive coworking space called The Coven--plus more things to see and do while on a business trip to Minneapolis and St. Paul. All products featured on WIRED are independently selected by our editors. However, we may receive compensation from retailers and/or from purchases of products through these links. Minnesota is the birthplace of the supercomputer, developed for code cracking during World War II. Tech giants of their day, including Cray Research and Control Data Corporation, were based in the Twin Cities.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.65)
- North America > United States > Wisconsin (0.04)
- North America > United States > North Carolina (0.04)
- (8 more...)
- Transportation (1.00)
- Leisure & Entertainment > Sports (1.00)
- Information Technology (1.00)
- (6 more...)
Fused Lasso Improves Accuracy of Co-occurrence Network Inference in Grouped Samples
Agyapong, Daniel, Beatty, Briana H., Kennedy, Peter G., Marks, Jane C., Hocking, Toby D.
Co-occurrence network inference algorithms have significantly advanced our understanding of microbiome communities. However, these algorithms typically analyze microbial associations within samples collected from a single environmental niche, often capturing only static snapshots rather than dynamic microbial processes. Previous studies have commonly grouped samples from different environmental niches together without fully considering how microbial communities adapt their associations when faced with varying ecological conditions. Our study addresses this limitation by explicitly investigating both spatial and temporal dynamics of microbial communities. We analyzed publicly available microbiome abundance data across multiple locations and time points, to evaluate algorithm performance in predicting microbial associations using our proposed Same-All Cross-validation (SAC) framework. SAC evaluates algorithms in two distinct scenarios: training and testing within the same environmental niche (Same), and training and testing on combined data from multiple environmental niches (All). To overcome the limitations of conventional algorithms, we propose fuser, an algorithm that, while not entirely new in machine learning, is novel for microbiome community network inference. It retains subsample-specific signals while simultaneously sharing relevant information across environments during training. Unlike standard approaches that infer a single generalized network from combined data, fuser generates distinct, environment-specific predictive networks. Our results demonstrate that fuser achieves comparable predictive performance to existing algorithms such as glmnet when evaluated within homogeneous environments (Same), and notably reduces test error compared to baseline algorithms in cross-environment (All) scenarios.
- North America > United States > Minnesota > Ramsey County > Saint Paul (0.14)
- North America > United States > Arizona > Coconino County > Flagstaff (0.04)
- Pacific Ocean (0.04)
- (3 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution
Qin, Tianrui, Chen, Qianben, Wang, Sinuo, Xing, He, Zhu, King, Zhu, He, Shi, Dingfeng, Liu, Xinxin, Zhang, Ge, Liu, Jiaheng, Jiang, Yuchen Eleanor, Gao, Xitong, Zhou, Wangchunshu
Large language models (LLMs) have demonstrated remarkable capabilities in complex reasoning tasks when equipped with external tools. However, current frameworks predominantly rely on sequential processing, leading to inefficient execution particularly for tasks requiring extensive tool interaction. This paper introduces Flash-Searcher, a novel parallel agent reasoning framework that fundamentally reimagines the execution paradigm from sequential chains to directed acyclic graphs (DAGs). Flash-Searcher decomposes complex tasks into subtasks with explicit dependencies, enabling concurrent execution of independent reasoning paths while maintaining logical constraints. Through dynamic workflow optimization, our framework continuously refines the execution graph based on intermediate results, effectively integrating summary module. Comprehensive evaluations across multiple benchmarks demonstrate that Flash-Searcher consistently outperforms existing approaches. Specifically, it achieves 67.7% accuracy on BrowseComp and 83% on xbench-DeepSearch, while reducing agent execution steps by up to 35% compared to current frameworks. Furthermore, when distilling this parallel reasoning pipeline into single models, we observe substantial performance gains across diverse backbone architectures, underscoring the generalizability of our methodology. Our work thus represents a significant advance in agent architecture design, offering a more scalable and efficient paradigm for complex reasoning tasks.
- Asia > Russia (0.45)
- Europe > Russia (0.27)
- South America > Brazil (0.14)
- (30 more...)
- Workflow (1.00)
- Research Report > New Finding (0.67)
- Leisure & Entertainment (0.93)
- Media > Music (0.68)
- Government (0.67)
Between proportionnality and envy-freeness: k-proportionality
This article deals with the cake cutting problem. In this setting, there exists two notions of fair division: proportional division (when there are n players, each player thinks to get at least 1/n of the cake) and envy-free division (each player wants to keep his or her share because he or she does not envy the portion given to another player). Some results are valid for proportional division but not for envy-free division. Here, we introduce and study a scale between the proportional division and the envy-free division. The goal is to understand where is the gap between statements about proportional division and envy-free division. This scale comes from the notion introduced in this article: k-proportionality. When k = n this notion corresponds to the proportional division and when k = 2 it corresponds to envy-free division. With k-proportionality we can understand where some difficulties in fair division lie. First, we show that there are situations in which there is no k-proportional and equitable division of a pie with connected pieces when k $\le$ n -1. This result was known only for envy-free division, ie k = 2. Next, we prove that there are situations in which there is no Pareto-optimal k-proportional division of a cake with connected pieces when k $\le$ n -1. This result was known only for k = 2. These theorems say that we can get an impossibility result even if we do not consider an envy-free division but a weaker notion. Finally, k-proportionality allows to give a generalization with a uniform statement of theorems about strong envy-free and strong proportional divisions.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
- North America > United States > Minnesota > Ramsey County > Saint Paul (0.04)
Outbreak of 'Frankenstein' rabbits with face tentacles now poses threat to HUMANS: Doctor warns which states disease will spread to next
More'Frankenstein' rabbits are appearing across the US, sparking fears of a wider outbreak. Originally spotted in Colorado, these bizarre rabbits, with tentacle-like growths sprouting from their faces, have now been reported in Minnesota, Nebraska, and South Dakota. The animals are infected with cottontail rabbit papilloma virus (CRPV), also known as Shope papilloma virus, which can be spread through mosquito and tick bites. While humans are unlikely to contract CRPV, Dr Omer Awan of the University of Maryland School of Medicine cautioned that people could still face risks from other diseases carried by ticks or mosquitoes that have fed on infected rabbits. 'You're not going to get CRPV, and you likely won't show symptoms of it,' Dr Awan told the Daily Mail.
- North America > United States > Nebraska (0.25)
- North America > United States > Maryland (0.25)
- North America > United States > Rocky Mountains (0.06)
- (6 more...)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
- Health & Medicine > Therapeutic Area > Immunology (1.00)
- Health & Medicine > Therapeutic Area > Oncology (0.73)
Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success
Wang, Che, van Baar, Jeroen, Mitash, Chaitanya, Li, Shuai, Randle, Dylan, Wang, Weiyao, Sontakke, Sumedh, Bekris, Kostas E., Katyal, Kapil
This work demonstrates how autonomously learning aspects of robotic operation from sparsely-labeled, real-world data of deployed, engineered solutions at industrial scale can provide with solutions that achieve improved performance. Specifically, it focuses on multi-suction robot picking and performs a comprehensive study on the application of multi-modal visual encoders for predicting the success of candidate robotic picks. Picking diverse items from unstructured piles is an important and challenging task for robot manipulation in real-world settings, such as warehouses. Methods for picking from clutter must work for an open set of items while simultaneously meeting latency constraints to achieve high throughput. The demonstrated approach utilizes multiple input modalities, such as RGB, depth and semantic segmentation, to estimate the quality of candidate multi-suction picks. The strategy is trained from real-world item picking data, with a combination of multimodal pretrain and finetune. The manuscript provides comprehensive experimental evaluation performed over a large item-picking dataset, an item-picking dataset targeted to include partial occlusions, and a package-picking dataset, which focuses on containers, such as boxes and envelopes, instead of unpackaged items. The evaluation measures performance for different item configurations, pick scenes, and object types. Ablations help to understand the effects of in-domain pretraining, the impact of different modalities and the importance of finetuning. These ablations reveal both the importance of training over multiple modalities but also the ability of models to learn during pretraining the relationship between modalities so that during finetuning and inference, only a subset of them can be used as input.
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
- Asia > South Korea > Daegu > Daegu (0.04)
- (8 more...)